Overview

Dataset info

Number of variables33
Number of observations569
Missing cells569 (3.0%)
Duplicate rows0 (0.0%)
Total size in memory146.8 KiB
Average record size in memory264.1 B

Variables types

Numeric21
Categorical1
Boolean0
Date0
URL0
Text (Unique)0
Rejected11
Unsupported0

Warnings

area_worst is highly correlated with area_mean (ρ = 0.9592133256) Rejected
concave_points_mean has 13 (2.3%) zeros Zeros
concave_points_se has 13 (2.3%) zeros Zeros
concave_points_worst is highly correlated with concave_points_mean (ρ = 0.9101553143) Rejected
concavity_mean is highly correlated with concave_points_mean (ρ = 0.9213910264) Rejected
concavity_se has 13 (2.3%) zeros Zeros
concavity_worst has 13 (2.3%) zeros Zeros
perimeter_mean is highly correlated with area_worst (ρ = 0.941549808) Rejected
perimeter_se is highly correlated with area_se (ρ = 0.937655407) Rejected
perimeter_worst is highly correlated with perimeter_mean (ρ = 0.970386887) Rejected
radius_mean is highly correlated with perimeter_worst (ρ = 0.965136514) Rejected
radius_se is highly correlated with perimeter_se (ρ = 0.972793677) Rejected
radius_worst is highly correlated with radius_mean (ρ = 0.9695389726) Rejected
texture_worst is highly correlated with texture_mean (ρ = 0.9120445888) Rejected
Unnamed_32 has constant value "nan" Rejected

Variables

area_mean
Numeric

Distinct count539
Unique (%)94.7%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean654.8891037
Minimum143.5
Maximum2501
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum143.5
5-th percentile275.78
Q1420.3
Median551.1
Q3782.7
95-th percentile1309.8
Maximum2501
Range2357.5
Interquartile range362.4

Descriptive statistics

Standard deviation351.9141292
Coef of variation0.5373644594
Kurtosis3.652302762
Mean654.8891037
MAD263.4833843
Skewness1.645732176
Sum372631.9
Variance123843.5543
Memory size4.5 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 143.5 221.25 358.25 603.15 718.2 1330.5 1801. 2501. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
512.2 3 0.5%
 
1214 2 0.4%
 
399.8 2 0.4%
 
758.6 2 0.4%
 
1075 2 0.4%
 
372.7 2 0.4%
 
684.5 2 0.4%
 
716.6 2 0.4%
 
1138 2 0.4%
 
658.8 2 0.4%
 
Other values (529) 548 96.3%
 

Minimum 5 values

ValueCountFrequency (%) 
143.5 1 0.2%
 
170.4 1 0.2%
 
178.8 1 0.2%
 
181 1 0.2%
 
201.9 1 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
2501 1 0.2%
 
2499 1 0.2%
 
2250 1 0.2%
 
2010 1 0.2%
 
1878 1 0.2%
 

area_se
Numeric

Distinct count528
Unique (%)92.8%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean40.33707909
Minimum6.802
Maximum542.2
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum6.802
5-th percentile11.36
Q117.85
Median24.53
Q345.19
95-th percentile115.8
Maximum542.2
Range535.398
Interquartile range27.34

Descriptive statistics

Standard deviation45.49100552
Coef of variation1.127771434
Kurtosis49.20907651
Mean40.33707909
MAD27.19650653
Skewness5.447186285
Sum22951.798
Variance2069.431583
Memory size4.5 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 6.802 11.185 25.21 35.185 54.2 106.2 178.35 542.2 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
17.67 3 0.5%
 
16.64 3 0.5%
 
18.54 3 0.5%
 
16.97 3 0.5%
 
14.91 2 0.4%
 
19.53 2 0.4%
 
12.67 2 0.4%
 
44.41 2 0.4%
 
17.86 2 0.4%
 
20.56 2 0.4%
 
Other values (518) 545 95.8%
 

Minimum 5 values

ValueCountFrequency (%) 
6.802 1 0.2%
 
7.228 1 0.2%
 
7.254 1 0.2%
 
7.326 1 0.2%
 
8.205 1 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
542.2 1 0.2%
 
525.6 1 0.2%
 
233 1 0.2%
 
224.1 1 0.2%
 
199.7 1 0.2%
 

area_worst
Highly correlated

This variable is highly correlated with area_mean and should be ignored for analysis

Correlation0.9592133256

compactness_mean
Numeric

Distinct count537
Unique (%)94.4%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.1043409842
Minimum0.01938
Maximum0.3454
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum0.01938
5-th percentile0.04066
Q10.06492
Median0.09263
Q30.1304
95-th percentile0.2087
Maximum0.3454
Range0.32602
Interquartile range0.06548

Descriptive statistics

Standard deviation0.05281275793
Coef of variation0.5061554512
Kurtosis1.650130467
Mean0.1043409842
MAD0.04110524022
Skewness1.190123031
Sum59.37002
Variance0.0027891874
Memory size4.5 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.01938 0.033955 0.049685 0.086015 0.13425 0.17095 0.24355 0.3454 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.1206 3 0.5%
 
0.1147 3 0.5%
 
0.04994 2 0.4%
 
0.1339 2 0.4%
 
0.1267 2 0.4%
 
0.2087 2 0.4%
 
0.07722 2 0.4%
 
0.07698 2 0.4%
 
0.03834 2 0.4%
 
0.1599 2 0.4%
 
Other values (527) 547 96.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0.01938 1 0.2%
 
0.02344 1 0.2%
 
0.0265 1 0.2%
 
0.02675 1 0.2%
 
0.03116 1 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
0.3454 1 0.2%
 
0.3114 1 0.2%
 
0.2867 1 0.2%
 
0.2839 1 0.2%
 
0.2832 1 0.2%
 

compactness_se
Numeric

Distinct count541
Unique (%)95.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.02547813884
Minimum0.002252
Maximum0.1354
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum0.002252
5-th percentile0.0078922
Q10.01308
Median0.02045
Q30.03245
95-th percentile0.060578
Maximum0.1354
Range0.133148
Interquartile range0.01937

Descriptive statistics

Standard deviation0.01790817933
Coef of variation0.702884125
Kurtosis5.106252483
Mean0.02547813884
MAD0.01313842534
Skewness1.90222071
Sum14.497061
Variance0.0003207028868
Memory size4.5 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.002252 0.0079 0.018195 0.035 0.04957 0.07557 0.1354 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.0231 3 0.5%
 
0.01104 3 0.5%
 
0.01812 3 0.5%
 
0.02219 2 0.4%
 
0.01877 2 0.4%
 
0.01382 2 0.4%
 
0.01395 2 0.4%
 
0.01587 2 0.4%
 
0.009169 2 0.4%
 
0.0118 2 0.4%
 
Other values (531) 546 96.0%
 

Minimum 5 values

ValueCountFrequency (%) 
0.002252 1 0.2%
 
0.003012 1 0.2%
 
0.00371 1 0.2%
 
0.003746 1 0.2%
 
0.00466 1 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
0.1354 1 0.2%
 
0.1064 1 0.2%
 
0.1006 1 0.2%
 
0.09806 1 0.2%
 
0.09586 1 0.2%
 

compactness_worst
Numeric

Distinct count529
Unique (%)93.0%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.2542650439
Minimum0.02729
Maximum1.058
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum0.02729
5-th percentile0.071196
Q10.1472
Median0.2119
Q30.3391
95-th percentile0.56412
Maximum1.058
Range1.03071
Interquartile range0.1919

Descriptive statistics

Standard deviation0.1573364889
Coef of variation0.6187893014
Kurtosis3.039288172
Mean0.2542650439
MAD0.1196837444
Skewness1.4735549
Sum144.67681
Variance0.02475477074
Memory size4.5 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.02729 0.0646 0.25805 0.42735 0.62055 1.058 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.3416 3 0.5%
 
0.1486 3 0.5%
 
0.1822 2 0.4%
 
0.3089 2 0.4%
 
0.1517 2 0.4%
 
0.07348 2 0.4%
 
0.1582 2 0.4%
 
0.4061 2 0.4%
 
0.171 2 0.4%
 
0.3055 2 0.4%
 
Other values (519) 547 96.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0.02729 1 0.2%
 
0.03432 1 0.2%
 
0.04327 1 0.2%
 
0.04619 1 0.2%
 
0.04712 1 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
1.058 1 0.2%
 
0.9379 1 0.2%
 
0.9327 1 0.2%
 
0.8681 1 0.2%
 
0.8663 1 0.2%
 

concave_points_mean
Numeric

Distinct count542
Unique (%)95.3%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.04891914587
Minimum0
Maximum0.2012
Zeros (%)2.3%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0.0056208
Q10.02031
Median0.0335
Q30.074
95-th percentile0.12574
Maximum0.2012
Range0.2012
Interquartile range0.05369

Descriptive statistics

Standard deviation0.03880284486
Coef of variation0.7932036459
Kurtosis1.066555703
Mean0.04891914587
MAD0.03145979977
Skewness1.171180081
Sum27.834994
Variance0.001505660769
Memory size4.5 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0. 0.000926 0.011065 0.03395 0.097735 0.1512 0.2012 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 13 2.3%
 
0.02864 3 0.5%
 
0.1242 2 0.4%
 
0.05252 2 0.4%
 
0.05778 2 0.4%
 
0.02031 2 0.4%
 
0.01924 2 0.4%
 
0.01615 2 0.4%
 
0.02594 2 0.4%
 
0.1471 2 0.4%
 
Other values (532) 537 94.4%
 

Minimum 5 values

ValueCountFrequency (%) 
0 13 2.3%
 
0.001852 1 0.2%
 
0.002404 1 0.2%
 
0.002924 1 0.2%
 
0.002941 1 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
0.2012 1 0.2%
 
0.1913 1 0.2%
 
0.1878 1 0.2%
 
0.1845 1 0.2%
 
0.1823 1 0.2%
 

concave_points_se
Numeric

Distinct count507
Unique (%)89.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.01179613708
Minimum0
Maximum0.05279
Zeros (%)2.3%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0.0038308
Q10.007638
Median0.01093
Q30.01471
95-th percentile0.022884
Maximum0.05279
Range0.05279
Interquartile range0.007072

Descriptive statistics

Standard deviation0.006170285174
Coef of variation0.5230767607
Kurtosis5.126301943
Mean0.01179613708
MAD0.004527923166
Skewness1.444678145
Sum6.712002
Variance3.807241913e-05
Memory size4.5 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0. 0.0049695 0.01391 0.01914 0.02886 0.05279 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 13 2.3%
 
0.0111 3 0.5%
 
0.01167 3 0.5%
 
0.01499 3 0.5%
 
0.01841 2 0.4%
 
0.01043 2 0.4%
 
0.012 2 0.4%
 
0.009155 2 0.4%
 
0.0191 2 0.4%
 
0.01262 2 0.4%
 
Other values (497) 535 94.0%
 

Minimum 5 values

ValueCountFrequency (%) 
0 13 2.3%
 
0.001852 1 0.2%
 
0.002386 1 0.2%
 
0.002404 1 0.2%
 
0.002924 1 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
0.05279 1 0.2%
 
0.0409 1 0.2%
 
0.03927 1 0.2%
 
0.03487 1 0.2%
 
0.03441 1 0.2%
 

concave_points_worst
Highly correlated

This variable is highly correlated with concave_points_mean and should be ignored for analysis

Correlation0.9101553143

concavity_mean
Highly correlated

This variable is highly correlated with concave_points_mean and should be ignored for analysis

Correlation0.9213910264

concavity_se
Numeric

Distinct count533
Unique (%)93.7%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.03189371634
Minimum0
Maximum0.396
Zeros (%)2.3%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0.0032526
Q10.01509
Median0.02589
Q30.04205
95-th percentile0.078936
Maximum0.396
Range0.396
Interquartile range0.02696

Descriptive statistics

Standard deviation0.03018606032
Coef of variation0.9464579166
Kurtosis48.8613953
Mean0.03189371634
MAD0.0186080085
Skewness5.110463049
Sum18.1475246
Variance0.0009111982378
Memory size4.5 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.0000e+00 3.4600e-04 9.9315e-03 3.1310e-02 5.7560e-02 8.1950e-02 1.4865e-01 3.9600e-01], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 13 2.3%
 
0.01376 2 0.4%
 
0.04344 2 0.4%
 
0.018 2 0.4%
 
0.02681 2 0.4%
 
0.02071 2 0.4%
 
0.03872 2 0.4%
 
0.01652 2 0.4%
 
0.02185 2 0.4%
 
0.02664 2 0.4%
 
Other values (523) 538 94.6%
 

Minimum 5 values

ValueCountFrequency (%) 
0 13 2.3%
 
0.000692 1 0.2%
 
0.0007929 1 0.2%
 
0.0009737 1 0.2%
 
0.001128 1 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
0.396 1 0.2%
 
0.3038 1 0.2%
 
0.1535 1 0.2%
 
0.1438 1 0.2%
 
0.1435 1 0.2%
 

concavity_worst
Numeric

Distinct count539
Unique (%)94.7%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.2721884833
Minimum0
Maximum1.252
Zeros (%)2.3%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0.01836
Q10.1145
Median0.2267
Q30.3829
95-th percentile0.68238
Maximum1.252
Range1.252
Interquartile range0.2684

Descriptive statistics

Standard deviation0.2086242806
Coef of variation0.7664699038
Kurtosis1.615253298
Mean0.2721884833
MAD0.1646995763
Skewness1.150236822
Sum154.875247
Variance0.04352409046
Memory size4.5 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.0000e+00 9.2250e-04 1.9580e-01 4.0345e-01 7.3960e-01 1.2520e+00], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 13 2.3%
 
0.1377 3 0.5%
 
0.4504 3 0.5%
 
0.1804 2 0.4%
 
0.3853 2 0.4%
 
0.4024 2 0.4%
 
0.256 2 0.4%
 
0.1811 2 0.4%
 
0.1564 2 0.4%
 
0.3344 2 0.4%
 
Other values (529) 536 94.2%
 

Minimum 5 values

ValueCountFrequency (%) 
0 13 2.3%
 
0.001845 1 0.2%
 
0.003581 1 0.2%
 
0.004955 1 0.2%
 
0.005518 1 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
1.252 1 0.2%
 
1.17 1 0.2%
 
1.105 1 0.2%
 
0.9608 1 0.2%
 
0.9387 1 0.2%
 

diagnosis
Categorical

Distinct count2
Unique (%)0.4%
Missing (%)0.0%
Missing (n)0
B
357
M
212
ValueCountFrequency (%) 
B 357 62.7%
 
M 212 37.3%
 
Max length1
Mean length1
Min length1
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

fractal_dimension_mean
Numeric

Distinct count499
Unique (%)87.7%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.06279760984
Minimum0.04996
Maximum0.09744
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum0.04996
5-th percentile0.053926
Q10.0577
Median0.06154
Q30.06612
95-th percentile0.07609
Maximum0.09744
Range0.04748
Interquartile range0.00842

Descriptive statistics

Standard deviation0.007060362795
Coef of variation0.1124304382
Kurtosis3.00589212
Mean0.06279760984
MAD0.005306453526
Skewness1.304488813
Sum35.73184
Variance4.98487228e-05
Memory size4.5 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.04996 0.052285 0.055015 0.063465 0.0692 0.07417 0.08252 0.09744 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.05667 3 0.5%
 
0.06113 3 0.5%
 
0.05913 3 0.5%
 
0.06782 3 0.5%
 
0.05907 3 0.5%
 
0.05715 2 0.4%
 
0.06758 2 0.4%
 
0.06331 2 0.4%
 
0.05866 2 0.4%
 
0.06284 2 0.4%
 
Other values (489) 544 95.6%
 

Minimum 5 values

ValueCountFrequency (%) 
0.04996 1 0.2%
 
0.05024 1 0.2%
 
0.05025 1 0.2%
 
0.05044 1 0.2%
 
0.05054 1 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
0.09744 1 0.2%
 
0.09575 1 0.2%
 
0.09502 1 0.2%
 
0.09296 1 0.2%
 
0.0898 1 0.2%
 

fractal_dimension_se
Numeric

Distinct count545
Unique (%)95.8%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.003794903866
Minimum0.0008948
Maximum0.02984
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum0.0008948
5-th percentile0.0015216
Q10.002248
Median0.003187
Q30.004558
95-th percentile0.0079598
Maximum0.02984
Range0.0289452
Interquartile range0.00231

Descriptive statistics

Standard deviation0.002646070967
Coef of variation0.6972695647
Kurtosis26.28084749
Mean0.003794903866
MAD0.001664151128
Skewness3.92396862
Sum2.1593003
Variance7.001691563e-06
Memory size4.5 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.0008948 0.0015165 0.0028255 0.004836 0.0062265 0.008223 0.01291 0.02984 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.002205 2 0.4%
 
0.002887 2 0.4%
 
0.003318 2 0.4%
 
0.001906 2 0.4%
 
0.00456 2 0.4%
 
0.001976 2 0.4%
 
0.003696 2 0.4%
 
0.003224 2 0.4%
 
0.002551 2 0.4%
 
0.001892 2 0.4%
 
Other values (535) 549 96.5%
 

Minimum 5 values

ValueCountFrequency (%) 
0.0008948 1 0.2%
 
0.0009502 1 0.2%
 
0.0009683 1 0.2%
 
0.001002 1 0.2%
 
0.001058 1 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
0.02984 1 0.2%
 
0.02286 1 0.2%
 
0.02193 1 0.2%
 
0.01792 1 0.2%
 
0.01298 1 0.2%
 

fractal_dimension_worst
Numeric

Distinct count535
Unique (%)94.0%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.08394581722
Minimum0.05504
Maximum0.2075
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum0.05504
5-th percentile0.062558
Q10.07146
Median0.08004
Q30.09208
95-th percentile0.11952
Maximum0.2075
Range0.15246
Interquartile range0.02062

Descriptive statistics

Standard deviation0.01806126735
Coef of variation0.2151538688
Kurtosis5.244610556
Mean0.08394581722
MAD0.01340966682
Skewness1.662579266
Sum47.76517
Variance0.0003262093782
Memory size4.5 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.05504 0.06419 0.083245 0.09356 0.1085 0.14385 0.2075 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.07427 3 0.5%
 
0.1297 2 0.4%
 
0.07918 2 0.4%
 
0.08633 2 0.4%
 
0.09136 2 0.4%
 
0.07623 2 0.4%
 
0.08174 2 0.4%
 
0.0895 2 0.4%
 
0.06915 2 0.4%
 
0.1023 2 0.4%
 
Other values (525) 548 96.3%
 

Minimum 5 values

ValueCountFrequency (%) 
0.05504 1 0.2%
 
0.05521 1 0.2%
 
0.05525 1 0.2%
 
0.05695 1 0.2%
 
0.05737 1 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
0.2075 1 0.2%
 
0.173 1 0.2%
 
0.1486 1 0.2%
 
0.1446 1 0.2%
 
0.1431 1 0.2%
 

id
Numeric

Distinct count569
Unique (%)100.0%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean30371831.43
Minimum8670
Maximum911320502
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum8670
5-th percentile90267
Q1869218
Median906024
Q38813129
95-th percentile90424461.4
Maximum911320502
Range911311832
Interquartile range7943911

Descriptive statistics

Standard deviation125020585.6
Coef of variation4.116333448
Kurtosis42.19319416
Mean30371831.43
MAD47795087.13
Skewness6.473751802
Sum1.728157208e+10
Variance1.563014683e+16
Memory size4.5 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[8.67000000e+03 8.98100000e+03 8.59615000e+04 9.23045000e+04 8.42409500e+05 ... 9.19550515e+07 8.71001502e+08 9.11226752e+08 9.11320502e+08 9.11320502e+08], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
883263 1 0.2%
 
906564 1 0.2%
 
89122 1 0.2%
 
9013579 1 0.2%
 
868682 1 0.2%
 
859465 1 0.2%
 
859464 1 0.2%
 
911685 1 0.2%
 
895299 1 0.2%
 
909220 1 0.2%
 
Other values (559) 559 98.2%
 

Minimum 5 values

ValueCountFrequency (%) 
8670 1 0.2%
 
8913 1 0.2%
 
8915 1 0.2%
 
9047 1 0.2%
 
85715 1 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
911320502 1 0.2%
 
911320501 1 0.2%
 
911296202 1 0.2%
 
911296201 1 0.2%
 
911157302 1 0.2%
 

perimeter_mean
Highly correlated

This variable is highly correlated with area_worst and should be ignored for analysis

Correlation0.941549808

perimeter_se
Highly correlated

This variable is highly correlated with area_se and should be ignored for analysis

Correlation0.937655407

perimeter_worst
Highly correlated

This variable is highly correlated with perimeter_mean and should be ignored for analysis

Correlation0.970386887

radius_mean
Highly correlated

This variable is highly correlated with perimeter_worst and should be ignored for analysis

Correlation0.965136514

radius_se
Highly correlated

This variable is highly correlated with perimeter_se and should be ignored for analysis

Correlation0.972793677

radius_worst
Highly correlated

This variable is highly correlated with radius_mean and should be ignored for analysis

Correlation0.9695389726

smoothness_mean
Numeric

Distinct count474
Unique (%)83.3%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.0963602812
Minimum0.05263
Maximum0.1634
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum0.05263
5-th percentile0.075042
Q10.08637
Median0.09587
Q30.1053
95-th percentile0.11878
Maximum0.1634
Range0.11077
Interquartile range0.01893

Descriptive statistics

Standard deviation0.01406412814
Coef of variation0.1459535813
Kurtosis0.8559749304
Mean0.0963602812
MAD0.01116098863
Skewness0.4563237648
Sum54.829
Variance0.0001977997003
Memory size4.5 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.05263 0.068555 0.079315 0.10755 0.1171 0.12885 0.1634 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.1007 5 0.9%
 
0.1075 4 0.7%
 
0.1054 4 0.7%
 
0.115 4 0.7%
 
0.1089 3 0.5%
 
0.1037 3 0.5%
 
0.09462 3 0.5%
 
0.1049 3 0.5%
 
0.08511 3 0.5%
 
0.1066 3 0.5%
 
Other values (464) 534 93.8%
 

Minimum 5 values

ValueCountFrequency (%) 
0.05263 1 0.2%
 
0.06251 1 0.2%
 
0.06429 1 0.2%
 
0.06576 1 0.2%
 
0.06613 1 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
0.1634 1 0.2%
 
0.1447 1 0.2%
 
0.1425 1 0.2%
 
0.1398 1 0.2%
 
0.1371 1 0.2%
 

smoothness_se
Numeric

Distinct count547
Unique (%)96.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.00704097891
Minimum0.001713
Maximum0.03113
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum0.001713
5-th percentile0.0036902
Q10.005169
Median0.00638
Q30.008146
95-th percentile0.012644
Maximum0.03113
Range0.029417
Interquartile range0.002977

Descriptive statistics

Standard deviation0.003002517944
Coef of variation0.4264347305
Kurtosis10.46983953
Mean0.00704097891
MAD0.002122905662
Skewness2.314450057
Sum4.006317
Variance9.015114003e-06
Memory size4.5 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.001713 0.003154 0.0040925 0.0068065 0.008135 0.010975 0.01593 0.03113 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.01052 2 0.4%
 
0.00604 2 0.4%
 
0.005884 2 0.4%
 
0.007595 2 0.4%
 
0.005251 2 0.4%
 
0.006399 2 0.4%
 
0.01017 2 0.4%
 
0.006494 2 0.4%
 
0.01 2 0.4%
 
0.01038 2 0.4%
 
Other values (537) 549 96.5%
 

Minimum 5 values

ValueCountFrequency (%) 
0.001713 1 0.2%
 
0.002667 1 0.2%
 
0.002826 1 0.2%
 
0.002838 1 0.2%
 
0.002866 1 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
0.03113 1 0.2%
 
0.02333 1 0.2%
 
0.02177 1 0.2%
 
0.02075 1 0.2%
 
0.01835 1 0.2%
 

smoothness_worst
Numeric

Distinct count411
Unique (%)72.2%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.132368594
Minimum0.07117
Maximum0.2226
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum0.07117
5-th percentile0.095734
Q10.1166
Median0.1313
Q30.146
95-th percentile0.17184
Maximum0.2226
Range0.15143
Interquartile range0.0294

Descriptive statistics

Standard deviation0.0228324294
Coef of variation0.172491289
Kurtosis0.5178251903
Mean0.132368594
MAD0.01795559947
Skewness0.4154259963
Sum75.31773
Variance0.0005213198325
Memory size4.5 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.07117 0.09235 0.107 0.15625 0.19055 0.2226 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.1216 4 0.7%
 
0.1312 4 0.7%
 
0.1275 4 0.7%
 
0.1256 4 0.7%
 
0.1415 4 0.7%
 
0.1223 4 0.7%
 
0.1234 4 0.7%
 
0.1401 4 0.7%
 
0.1347 4 0.7%
 
0.1316 3 0.5%
 
Other values (401) 530 93.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0.07117 1 0.2%
 
0.08125 1 0.2%
 
0.08409 1 0.2%
 
0.08484 1 0.2%
 
0.08567 1 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
0.2226 1 0.2%
 
0.2184 1 0.2%
 
0.2098 1 0.2%
 
0.2006 1 0.2%
 
0.1909 1 0.2%
 

symmetry_mean
Numeric

Distinct count432
Unique (%)75.9%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.1811618629
Minimum0.106
Maximum0.304
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum0.106
5-th percentile0.1415
Q10.1619
Median0.1792
Q30.1957
95-th percentile0.23072
Maximum0.304
Range0.198
Interquartile range0.0338

Descriptive statistics

Standard deviation0.02741428134
Coef of variation0.1513247926
Kurtosis1.287932992
Mean0.1811618629
MAD0.02114575814
Skewness0.7256089734
Sum103.0811
Variance0.0007515428212
Memory size4.5 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.106 0.1338 0.15055 0.19785 0.22035 0.2596 0.304 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.1714 4 0.7%
 
0.1769 4 0.7%
 
0.1893 4 0.7%
 
0.1717 4 0.7%
 
0.1601 4 0.7%
 
0.2116 3 0.5%
 
0.193 3 0.5%
 
0.1619 3 0.5%
 
0.1861 3 0.5%
 
0.172 3 0.5%
 
Other values (422) 534 93.8%
 

Minimum 5 values

ValueCountFrequency (%) 
0.106 1 0.2%
 
0.1167 1 0.2%
 
0.1203 1 0.2%
 
0.1215 1 0.2%
 
0.122 1 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
0.304 1 0.2%
 
0.2906 1 0.2%
 
0.2743 1 0.2%
 
0.2678 1 0.2%
 
0.2655 1 0.2%
 

symmetry_se
Numeric

Distinct count498
Unique (%)87.5%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.02054229877
Minimum0.007882
Maximum0.07895
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum0.007882
5-th percentile0.011758
Q10.01516
Median0.01873
Q30.02348
95-th percentile0.034988
Maximum0.07895
Range0.071068
Interquartile range0.00832

Descriptive statistics

Standard deviation0.008266371529
Coef of variation0.4024073265
Kurtosis7.896129828
Mean0.02054229877
MAD0.005819244146
Skewness2.1951329
Sum11.688568
Variance6.833289825e-05
Memory size4.5 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.007882 0.010545 0.013155 0.021065 0.02797 0.03525 0.04523 0.07895 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.01344 4 0.7%
 
0.01536 3 0.5%
 
0.01897 3 0.5%
 
0.01884 3 0.5%
 
0.02045 3 0.5%
 
0.0187 3 0.5%
 
0.01924 3 0.5%
 
0.01454 3 0.5%
 
0.01647 3 0.5%
 
0.01719 2 0.4%
 
Other values (488) 539 94.7%
 

Minimum 5 values

ValueCountFrequency (%) 
0.007882 1 0.2%
 
0.009539 1 0.2%
 
0.009947 1 0.2%
 
0.01013 1 0.2%
 
0.01029 1 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
0.07895 1 0.2%
 
0.06146 1 0.2%
 
0.05963 1 0.2%
 
0.05628 1 0.2%
 
0.05543 1 0.2%
 

symmetry_worst
Numeric

Distinct count500
Unique (%)87.9%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.2900755712
Minimum0.1565
Maximum0.6638
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum0.1565
5-th percentile0.2127
Q10.2504
Median0.2822
Q30.3179
95-th percentile0.40616
Maximum0.6638
Range0.5073
Interquartile range0.0675

Descriptive statistics

Standard deviation0.06186746754
Coef of variation0.2132805161
Kurtosis4.444559518
Mean0.2900755712
MAD0.04487112098
Skewness1.433927765
Sum165.053
Variance0.00382758354
Memory size4.5 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.1565 0.2095 0.22155 0.33225 0.3702 0.48725 0.6638 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.2226 3 0.5%
 
0.2369 3 0.5%
 
0.2383 3 0.5%
 
0.2972 3 0.5%
 
0.3196 3 0.5%
 
0.3109 3 0.5%
 
0.2694 2 0.4%
 
0.3103 2 0.4%
 
0.2404 2 0.4%
 
0.3187 2 0.4%
 
Other values (490) 543 95.4%
 

Minimum 5 values

ValueCountFrequency (%) 
0.1565 1 0.2%
 
0.1566 1 0.2%
 
0.1603 1 0.2%
 
0.1648 1 0.2%
 
0.1652 1 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
0.6638 1 0.2%
 
0.5774 1 0.2%
 
0.5558 1 0.2%
 
0.544 1 0.2%
 
0.5166 1 0.2%
 

texture_mean
Numeric

Distinct count479
Unique (%)84.2%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean19.28964851
Minimum9.71
Maximum39.28
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum9.71
5-th percentile13.088
Q116.17
Median18.84
Q321.8
95-th percentile27.15
Maximum39.28
Range29.57
Interquartile range5.63

Descriptive statistics

Standard deviation4.301035768
Coef of variation0.2229711841
Kurtosis0.7583189724
Mean19.28964851
MAD3.38496465
Skewness0.6504495421
Sum10975.81
Variance18.49890868
Memory size4.5 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 9.71 12.725 14.61 22.545 25.525 29.89 39.28 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
14.93 3 0.5%
 
15.7 3 0.5%
 
18.9 3 0.5%
 
16.84 3 0.5%
 
17.46 3 0.5%
 
18.22 3 0.5%
 
20.52 3 0.5%
 
16.85 3 0.5%
 
19.83 3 0.5%
 
18.89 2 0.4%
 
Other values (469) 540 94.9%
 

Minimum 5 values

ValueCountFrequency (%) 
9.71 1 0.2%
 
10.38 1 0.2%
 
10.72 1 0.2%
 
10.82 1 0.2%
 
10.89 1 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
39.28 1 0.2%
 
33.81 1 0.2%
 
33.56 1 0.2%
 
32.47 1 0.2%
 
31.12 1 0.2%
 

texture_se
Numeric

Distinct count519
Unique (%)91.2%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean1.216853427
Minimum0.3602
Maximum4.885
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum0.3602
5-th percentile0.54014
Q10.8339
Median1.108
Q31.474
95-th percentile2.212
Maximum4.885
Range4.5248
Interquartile range0.6401

Descriptive statistics

Standard deviation0.5516483926
Coef of variation0.4533400493
Kurtosis5.349168692
Mean1.216853427
MAD0.4087402152
Skewness1.646443809
Sum692.3896
Variance0.3043159491
Memory size4.5 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.3602 0.62115 1.5105 1.976 2.9185 4.885 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1.35 3 0.5%
 
1.268 3 0.5%
 
0.8561 3 0.5%
 
1.15 3 0.5%
 
1.428 2 0.4%
 
0.9429 2 0.4%
 
1.169 2 0.4%
 
1.39 2 0.4%
 
1.166 2 0.4%
 
1.199 2 0.4%
 
Other values (509) 545 95.8%
 

Minimum 5 values

ValueCountFrequency (%) 
0.3602 1 0.2%
 
0.3621 1 0.2%
 
0.3628 1 0.2%
 
0.3871 1 0.2%
 
0.3981 1 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
4.885 1 0.2%
 
3.896 1 0.2%
 
3.647 1 0.2%
 
3.568 1 0.2%
 
3.12 1 0.2%
 

texture_worst
Highly correlated

This variable is highly correlated with texture_mean and should be ignored for analysis

Correlation0.9120445888

Unnamed_32
Constant

This variable is constant and should be ignored for analysis

Constant valuenan

Correlations

Missing values

Sample

First rows

area_meanarea_searea_worstcompactness_meancompactness_secompactness_worstconcave_points_meanconcave_points_seconcave_points_worstconcavity_meanconcavity_seconcavity_worstdiagnosisfractal_dimension_meanfractal_dimension_sefractal_dimension_worstidperimeter_meanperimeter_seperimeter_worstradius_meanradius_seradius_worstsmoothness_meansmoothness_sesmoothness_worstsymmetry_meansymmetry_sesymmetry_worsttexture_meantexture_setexture_worstUnnamed_32
01001.0153.402019.00.277600.049040.66560.147100.015870.26540.300100.053730.7119M0.078710.0061930.11890842302122.808.589184.6017.991.095025.380.118400.0063990.16220.24190.030030.460110.380.905317.33NaN
11326.074.081956.00.078640.013080.18660.070170.013400.18600.086900.018600.2416M0.056670.0035320.08902842517132.903.398158.8020.570.543524.990.084740.0052250.12380.18120.013890.275017.770.733923.41NaN
21203.094.031709.00.159900.040060.42450.127900.020580.24300.197400.038320.4504M0.059990.0045710.0875884300903130.004.585152.5019.690.745623.570.109600.0061500.14440.20690.022500.361321.250.786925.53NaN
3386.127.23567.70.283900.074580.86630.105200.018670.25750.241400.056610.6869M0.097440.0092080.173008434830177.583.44598.8711.420.495614.910.142500.0091100.20980.25970.059630.663820.381.156026.50NaN
41297.094.441575.00.132800.024610.20500.104300.018850.16250.198000.056880.4000M0.058830.0051150.0767884358402135.105.438152.2020.290.757222.540.100300.0114900.13740.18090.017560.236414.340.781316.67NaN
5477.127.19741.60.170000.033450.52490.080890.011370.17410.157800.036720.5355M0.076130.0050820.1244084378682.572.217103.4012.450.334515.470.127800.0075100.17910.20870.021650.398515.700.890223.75NaN
61040.053.911606.00.109000.013820.25760.074000.010390.19320.112700.022540.3784M0.057420.0021790.08368844359119.603.180153.2018.250.446722.880.094630.0043140.14420.17940.013690.306319.980.773227.66NaN
7577.950.96897.00.164500.030290.36820.059850.014480.15560.093660.024880.2678M0.074510.0054120.115108445820290.203.856110.6013.710.583517.060.118900.0088050.16540.21960.014860.319620.831.377028.14NaN
8519.824.32739.30.193200.035020.54010.093530.012260.20600.185900.035530.5390M0.073890.0037490.1072084498187.502.406106.2013.000.306315.490.127300.0057310.17030.23500.021430.437821.821.002030.73NaN
9475.923.94711.40.239600.072171.05800.085430.014320.22100.227300.077431.1050M0.082430.0100800.207508450100183.972.03997.6512.460.297615.090.118600.0071490.18530.20300.017890.436624.041.599040.68NaN

Last rows

area_meanarea_searea_worstcompactness_meancompactness_secompactness_worstconcave_points_meanconcave_points_seconcave_points_worstconcavity_meanconcavity_seconcavity_worstdiagnosisfractal_dimension_meanfractal_dimension_sefractal_dimension_worstidperimeter_meanperimeter_seperimeter_worstradius_meanradius_seradius_worstsmoothness_meansmoothness_sesmoothness_worstsymmetry_meansymmetry_sesymmetry_worsttexture_meantexture_setexture_worstUnnamed_32
559403.516.97474.20.102100.0298200.251700.041050.012670.096530.111200.057380.3630B0.065700.0047380.0873292529174.521.93682.2811.510.238812.4800.092610.0082000.129800.13880.014880.211223.932.90437.16NaN
560600.429.84706.70.112600.0267800.226400.043040.016260.104800.044620.020710.1326B0.061710.0053040.0832192529291.382.888100.2014.050.364515.3000.099290.0072560.124100.15370.020800.225027.151.49233.17NaN
561386.022.81439.60.035580.0088780.054940.000000.000000.000000.000000.000000.0000B0.055020.0017730.0590592531170.672.04175.1911.200.314111.9200.074490.0075940.092670.10600.019890.156629.373.89638.30NaN
562716.922.65915.00.208700.0484400.791700.094290.016080.235600.255000.073591.1700M0.071520.0061420.14090925622103.402.362128.7015.220.260217.5200.104800.0046250.141700.21280.021370.408930.621.20542.79NaN
5631347.0118.801819.00.223600.0431000.418600.147400.026240.254200.317400.078450.6599M0.068790.0062130.09873926125143.008.758179.1020.920.962224.2900.109900.0063990.140700.21490.020570.292925.091.02629.41NaN
5641479.0158.702027.00.115900.0289100.211300.138900.024540.221600.243900.051980.4107M0.056230.0042390.07115926424142.007.673166.1021.561.176025.4500.111000.0103000.141000.17260.011140.206022.391.25626.40NaN
5651261.099.041731.00.103400.0242300.192200.097910.016780.162800.144000.039500.3215M0.055330.0024980.06637926682131.205.203155.0020.130.765523.6900.097800.0057690.116600.17520.018980.257228.252.46338.25NaN
566858.148.551124.00.102300.0373100.309400.053020.015570.141800.092510.047300.3403M0.056480.0038920.07820926954108.303.425126.7016.600.456418.9800.084550.0059030.113900.15900.013180.221828.081.07534.12NaN
5671265.086.221821.00.277000.0615800.868100.152000.016640.265000.351400.071170.9387M0.070160.0061850.12400927241140.105.772184.6020.600.726025.7400.117800.0065220.165000.23970.023240.408729.331.59539.42NaN
568181.019.15268.60.043620.0046600.064440.000000.000000.000000.000000.000000.0000B0.058840.0027830.070399275147.922.54859.167.760.38579.4560.052630.0071890.089960.15870.026760.287124.541.42830.37NaN